Psychophysical and physiological evidence for viewer-centered object representations in the primate.

نویسندگان

  • N K Logothetis
  • J Pauls
چکیده

A key question concerning the perception of 3D objects is the spatial reference frame used by the brain to represent them. The celerity of the recognition process could be explained by the visual system's ability to quickly transform stored models of familiar 3D objects, or by its ability to specify the relationship among viewpoint-invariant features or volumetric primitives that can be used to accomplish a structural description of an image. Alternatively, viewpoint-invariant recognition could be realized by a system endowed with the ability to perform an interpolation between a set of stored 2D templates, created for each experienced viewpoint. In the present study we set out to examine the nature of object representation in the primate in combined psychophysical-electrophysiological experiments. Monkeys were trained to recognize novel objects from a given viewpoint and subsequently were tested for their ability to generalize recognition for views generated by mathematically rotating the objects around any arbitrary axis. The perception of 3D novel objects was found to be a function of the object's retinal projection at the time of the recognition encounter. Recognition became increasingly difficult for the monkeys as the stimulus was rotated away from its familiar attitude. The generalization field for novel wire-like and spheroidal objects extended to about +/- 40 degrees around an experienced viewpoint. When the animals were trained with as few as three views of the object, 120 degrees apart, they could often interpolate recognition for all views resulting from rotations around the same axis. Recordings from inferotemporal cortex during the psychophysical testing showed a number of neurons with remarkable selectivity for individual views of those objects that the monkey had learned to recognize. Plotting the response of neurons as a function of rotation angle revealed systematic view-tuning curves for rotations in depth. A small percentage of the view-selective cells responded strongly for a particular view and its mirror-symmetrical view. For some of the tested objects, different neurons were found to be tuned to different views of the same object; the peaks of the view-tuning curves were 40-50 degrees apart. Neurons were also found that responded to the sight of unfamiliar objects or distractors. Such cells, however, gave nonspecific responses to a variety of other patterns presented while the monkey performed a simple fixation task.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Handbook of Pattern Recognition and Computer Vision, Pp. 863{882 Viewer-centered Representations in Object Recognition: a Computational Approach

Visual object recognition is a process in which representations of objects are used to identify those objects in images. Recent psychophysical and physiological studies indicate that the visual system uses viewer-centered representations. In this chapter a recognition scheme that uses viewer-centered representations is presented. The scheme requires storing only a small number of views to repre...

متن کامل

Viewer-centered Object Recognition in Monkeys

How does the brain recognize three-dimensional objects? An initial step towards the understanding of the neural substrate of visual object recognition can be taken by studying rst the nature of object representation , as manifested in behavioral studies with humans or non-human primates. One fundamental question is whether these representations are object or viewer centered. We trained monkeys ...

متن کامل

Egocentric Spatial Representation in Early Vision

Recent physiological experiments have shown that the responses of many neurons in V1 and V3a are modulated by the direction of gaze. We have developed a neural network model of the hierarchy of maps in visual cortex to explore the hypothesis that visual features are encoded in egocentric (spatiotopic) coordinates at early stages of visual processing. Most psychophysical studies that have attemp...

متن کامل

Pixels, voxels, and views: A study of shape representations for single view 3D object shape prediction

The goal of this paper is to compare surface-based and volumetric 3D object shape representations, as well as viewer-centered and object-centered reference frames for single-view 3D shape prediction. We propose a new algorithm for predicting depth maps from multiple viewpoints, with a single depth or RGB image as input. By modifying the network and the way models are evaluated, we can directly ...

متن کامل

Viewpoint-dependent recognition of familiar faces.

The question whether object representations in the human brain are object-centered or viewer-centered has motivated a variety of experiments with divergent results. A key issue concerns the visual recognition of objects seen from novel views. If recognition performance depends on whether a particular view has been seen before, it can be interpreted as evidence for a viewer-centered representati...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Cerebral cortex

دوره 5 3  شماره 

صفحات  -

تاریخ انتشار 1995